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ABSTRACT. We consider the nearest-neighbor simple random walk on Z d , d > 2, driven 
by a field of i.i.d. random nearest-neighbor conductances tv X y G [0, 1] . Apart from the 
requirement that the bonds with positive conductances percolate, we pose no restriction 
on the law of the a/s. We prove that, for a.e. realization of the environment, the path 
distribution of the walk converges weakly to that of non-degenerate, isotropic Brownian 
motion. The quenched functional CLT holds despite the fact that the local CLT may fail 
in d > 5 due to anomalously slow decay of the probability that the walk returns to the 
starting point at a given time. 

1. Introduction 

Let Brf denote the set of unordered nearest-neighbor pairs (i.e., edges) of Z d and let 
(cVb)beB d be i.i.d. random variables with <x>i, € [0, 1] . We will refer to tut, as the conductance 
of the edge b. Let P denote the law of the co's and suppose that 

F(cv b > 0) > p c (d), (1.1) 

where p c (d) is the threshold for bond percolation on in d = 1 we have p c (d) = 1 so 
there we suppose <x>t, > a.s. This condition ensures the existence of a unique infinite 
connected component 'tfoo of edges with strictly positive conductances; we will typically 
restrict attention to a/s for which contains a given site (e.g., the origin). 

Each realization of can be used to define a random walk X = (X„) which moves 
about 'rfco by picking, at each unit time, one of its 2d neighbors at random and passing to 
it with probability equal to the conductance of the corresponding edge. Technically, X is 
a Markov chain with state space and the transition probabilities defined by 

PcoA^n+i = y\X„ = x):=^- (1.2) 



if x,y G ^oo and |x — y| = 1, and 

r\Y.. = rV= 1 - 

2d 



Pw,z(Xn+l = x\X n = x):=l-^ £ co xy . (1.3) 

y- |y-*l=i 
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The second index on P W/Z marks the initial position of the walk, i.e., 

Po,, z (X = z) := 1. (1.4) 

The counting measure on is invariant and reversible for this Markov chain. 

The d = 1 walk is a simple, but instructive, exercise in harmonic analysis of reversible 
random walks in random environments. Let us quickly sketch the proof of the fact that, 
for a.e. to sampled from a translation-invariant, ergodic law on (0, l] Brf satisfying 



(— ) < °°, (1-5) 

\COhJ 



E 

\<x> h 



the walk scales to Brownian motion under the usual diffusive scaling of space and time. 
(Here and henceforth E denotes expectation with respect to the environment distribu- 
tion.) The derivation works even for unbounded conductances provided dl.2Hl.3l ) are 
modified accordingly. 

Abbreviate C := M(l/coj,). The key step of the proof is to realize that 

fM) := x + 1 £ (— !— - C) (1.6) 

is harmonic for the Markov chain. Hence cp w {X n ) is a martingale whose increments are, 
by (|1.5|) and a simple calculation, square integrable in the sense 

EE^of^Xi) 2 ] <oo. (1.7) 

Invoking the stationarity and ergodicity of the Markov chain on the space of environ- 
ments "from the point of view of the particle" — we will discuss the specifics of this ar- 
gument later — the martingale (<p w (X n )) satisfies the conditions of the Lindeberg-Feller 
martingale functional CLT and so the law of t i— > tp w (X| wt | ) / \pn tends weakly to that of a 
Brownian motion with diffusion constant given by (L7J). By the Pointwise Ergodic Theo- 
rem and (|1.5[) we have f w (x) — x = o(x) as \x\ — > oo. Thus the path t i— > X^ ni j / y/n scales, 
in the limit n — > oo, to the same function as the deformed path t i— > <p w (X^ nf j )/\/n. As 
this holds for a.e. a;, we have proved a quenched functional CLT. 

While the main ideas of the above d = 1 solution work in all dimensions, the situation 
in d > 2 is, even for i.i.d. conductances, significantly more complicated. Progress has 
been made under additional conditions on the environment law. One such condition is 

strong ellipticity, 

3a > : P(a < cv b < V a ) = 1. (1.8) 

Here an annealed invariance principle was proved by Kipnis and Varadhan [20] and its 
queneched counterpart by Sidoravicius and Sznitman [29 J. Another natural family of 
environments are those arising from supercritical bond percolation on Z d for which (tOf,) 
are i.i.d. zero-one valued with ¥(cOj, = 1) > p c (d). For these cases an annealed in- 
variance principle was proved by De Masi, Ferrari, Goldstein and Wick fill [12| and 
the quenched case was established in d > 4 by Sidoravicius and Sznitman ||29| , and in 
all d > 2 by Berger and Biskup HI and Mathieu and Piatnitski p5| . 
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A common feature of the latter proofs is that, in d > 3, they require the use of heat- 
kernel upper bounds of the form 



where C\, C2 are absolute constants and n is assumed to exceed a random quantity de- 
pending on the environment in the vicinity of x and y. These were deduced by Bar- 
low |2l using sophisticated arguments that involve isoperimetry regular volume growth 
and comparison of graph-theoretical and Euclidean distances for the percolation cluster. 
While the use of \\S) is conceptually rather unsatisfactory — one seems to need a local- 
CLT level of control to establish a plain CLT — no arguments (in d > 3) that avoid 
heat-kernel estimates are known at present. 

The reliance on heat-kernel bounds also suffers from another problem: (|1.9|) may ac- 
tually fail once the conductance law has sufficiently heavy tails at zero. This was first 
noted to happen by Fontes and Mathieu [14J for the heat-kernel averaged over the envi- 
ronment; the quenched situation was analyzed recently by Berger, Biskup, Hoffman and 
Kozma 0. The main conclusion of (7| is that the diagonal (i.e., x = y) bound in \1.9) 
holds in d = 2, 3 but the decay can be slower than any o(n~ 2 ) sequence in d > 5. (The 
threshold sequence in d = 4 is presumably o(n~ 2 log n).) This is caused by the existence 
of traps that may capture the walk for a long time and thus, paradoxically, increase its 
chances to arrive back to the starting point. 

The aformentioned facts lead to a natural question: In the absence of heat-kernel esti- 
mates, does the quenched CLT still hold? Our answer to this question is affirmative and 
constitutes the main result of this note. Another interesting question is what happens 
when the conductances are unbounded from above; this is currently being studied by 
Barlow and Deuschel (3J . 

Note: While this paper was in the process of writing, we received a preprint from Pierre 
Mathieu [24J in which he proves a result that is a continuous-time version of our main 
theorem. The strategy of [24J differs from ours by the consideration of a time-changed 
process (which we use only marginally) and proving that the "new" and "old" time 
scales are commensurate. Our approach is focused on proving the (pointwise) sublin- 
earity of the corrector and it streamlines considerably the proof of |6] in d > 3 in that it 
limits the use of "heat-kernel technology" to a uniform bound on the heat-kernel decay 
(implied by isoperimetry) and a diffusive bound on the expected distance travelled by 
the walk (implied by regular volume growth). 



Let O := [0, l] Brf be the set of all admissible random environments and let P be an i.i.d. 
law on Q. Assuming let ffoo denote the a.s. unique infinite connected component 
of edges with positive conductances and introduce the conditional measure 





(1.9) 



2. Main results and outline 



Po(-) :=P(-|0e#oo). 



(2.1) 
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For T > 0, let ( C [0, T] , Wj ) be the space of continuous functions / : [0, T] — > R rf equipped 
with the Borel c-algebra defined relative to the supremum topology. 
Here is our main result: 

Theorem 2.1 Suppose d > 2 and F(to b > 0) > p c (d). For to £ {0 G ffco}, let (X n ) n >o fee 
£/ze random walk with law P W/ o and let 

B n (t) := -j=(X [tn \ + (tn - LtnJ)(X LfnJ+1 - X [tn] )), t > 0. (2.2) 



Then for allT > and for F^-almost every to, the law of (B„(t) : < t < T) on (C[0, T], #r) 
converges, as n — > oo, weakly to the law of an isotropic Brownian motion (B t : < t < T) with 
a positive and finite diffusion constant (which is independent of to). 

Using a variant of J6l Lemma 6.4], from here we can extract a corresponding conclu- 
sion for the "agile" version of our random walk (cf [6, Theorem 1.2]) by which we mean 
the walk that jumps from x to its neighbor y with probability to XXJ l 7t u (x) where tc w (x) 
is the sum of to xz over all of the neighbors z of x. Replacing discrete times by sums of 
i.i.d. exponential random variables, these invariance principles then extend also to the 
corresponding continuous-time processes. Theorem 12.11 of course implies also an an- 
nealed invariance principle, which is the above convergence for the walk sampled from 
the path measure integrated over the environment. 

Remark 2.2 As we were reminded by Y. Peres, the above functional CLT automatically 
implies the "usual" lower bound on the heat-kernel. Indeed, the Markov property and 
reversibility of X yield 

Po;,o(X 2M = 0) > £ Po,,o(X„ = xf. (2.3) 

Cauchy-Schwarz then gives 

PcodXm = 0) > P W;0 (|X n | < v^)' nr V r\dV {2A) 

Now Theorem 12. 1 1 implies that P w ,o(|X n | < ^fn) is uniformly positive as n — > oo and the 
Spatial Ergodic Theorem shows that |^oo Pi \—\fn, \fn\ d \ grows proportionally to n d/2 . 
Hence we get 

PcvdXm = 0) > n>\, (2.5) 



n 



with C(to) > a.s. on the set {0 S ffco}- Note that, in d = 2,3, this complements nicely 
the "universal" upper bounds derived in [7|. 

The remainder of this paper is devoted to the proof of Theorem 12.11 The main line 
of attack is similar to the above ID solution: We define a harmonic coordinate <p w — an 
analogue of (|1.6|) — and then prove an a.s. invariance principle for 

ti-+<p a (X [nti )/Vn (2.6) 



RANDOM WALK AMONG RANDOM CONDUCTANCES 



5 



along the martingale argument sketched before. The difficulty comes with showing the 
sublinearity of the corrector, 

<p u (x) -X = o(x), \x\ -> oo. (2.7) 

As in Berger and Biskup |6], sublinearity can be proved directly along coordinate direc- 
tions by soft ergodic-theory arguments. The crux is to extend this to a bound throughout 
d -dimensional boxes. 

Following the d > 3 proof of |6], the bound along coordinate axes readily implies 
sublinearity on average, meaning that the set of x where \f w (x) — x\ exceeds e\x\ has 
zero density. The extension of sublinearity on average to pointwise sublinearity is the 
main novel part of the proof which, unfortunately, still makes non-trivial use of the 
"heat-kernel technology." A heat-kernel upper bound of the form \\.9) would do but, 
to minimize the extraneous input, we show that it suffices to have a diffusive bound for 
the expected displacement of the walk from its starting position. This step still requires 
detailed control of isoperimetry and volume growth as well as the comparison of the 
graph-theoretic and Euclidean distances, but it avoids many spurious calculations that 
are needed for the full-fledged heat-kernel estimates. 

Of course, the required isoperimetric inequalities may not be true on because of 
the presence of "weak" bonds. As in [7 J we circumvent this by observing the random 
walk on the set of sites that have a connection to infinity by bonds with uniformly positive 
conductances. Specifically we pick a > and let c &c X1A denote the set of sites in Z rf that 
are connected to infinity by a path whose edges obey coj, > a. Here we note: 

Proposition 2.3 Let d > 2 and p = P(a?2, > 0) > p c {d). Then there exists c(p,d) > such 
that if a. satisfies 

F(co h > a) > p c (d) (2.8) 

and 

P(0 < co h < a) < c{p,d) (2.9) 

then c €c aA is nonempty and \ ^oo^ has only finite components a.s. In fact, if^(x) is the set 
of sites (possibly empty) in the finite component of^^ \ ^co^ containing x, then 

F(x G <^oo & diami^(x) > n) < Ce^ n , n > 1, (2.10) 

for some C < oo and n > 0. Here "diam" is the diameter in the distance on Z d . 

The restriction of (po to ^oo^ is still harmonic, but with respect to a walk that can "jump 
the holes" of ffco^- A discrete-time version of this walk was utilized heavily in [7J; for the 
purposes of this paper it will be more convenient to work with its continuous-time coun- 
terpart Y = (Yf)f>o- Explicitly, sample a path of the random walk X = (X n ) from P W/ o 
and denote by T\, Ti,... the time intervals between successive visits of X to ^oo,a. These 
are defined recursively by 

T j+1 := inf {n > 1 : X Tl+ ... +IH , ; S (2.11) 

with To = 0. For each x,y G ^co^, let 

& xy := Pa,,x(X Tl = y) (2.12) 
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and define the operator 

(/&>/) 00 := E u xy [f{y)-f{x)]. (2.13) 

The continuous-time random walk Y is a Markov process with this generator; alterna- 
tively take the standard Poisson process (Nt)t>o with jump-rate one and set 

Y t :=X Tl+ ,.. +w (2.14) 

Note that, while Y may jump "over the holes" of c ra C o A , Proposition 12.31 ensures that 
all of its jumps are finite. The counting measure on ^oo,a is still invariant for this ran- 
dom walk, C& is self-adjoint on the corresponding space of square integrable functions 
and cp w = on tf^a (see Lemma 1572]) . 

The skeleton of the proof is condensed into the following statement whose proof, and 
adaptation to the present situation, is the main novel part of this note: 

Theorem 2.4 Fix cc as in J2.gW2.9D and suppose xp w : c € oaA —> R rf is a function and 9 > is a 
number such that the following holds for a.e. to: 

(1) (Harmonicity) If cp w {x) := x + tp w (x), then Cat q>w = on Ifcojc 

(2) (Sublinearity on average) For every e > 0, 

\ 

l™—d E %«(*)!>«.}= 0. (2.15) 
\x\<n 

(3) (Polynomial growth) 

v IV'wWI n /nifl 

lim max — — = 0. (2.16) 

|x|<n 

Let Y = ( Yf ) foe £/ze continuous-time random walk on ^oo A with generator cffl and suppose also: 

(4) (Diffusive upper bounds) For a deterministic sequence b n = o(n 2 ) and a.e. to, 

Eu>,x Yj — x . 
sup max sup t= < oo (2.17) 



|*| <n 



and 



Then for almost every oo, 



sup max sup t d/2 P a!/X (Y t = x) < oo. (2.18) 

n>l xe< ^~,« t>b n 
\x\<n 



lim max HM^M = o. (2.19) 

n^oo xG^ff n 
\x\<n 
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This result — with rp w playing the role of the corrector — shows that (p w {x) — x = 
o(x) on ^oo, a - This readily extends to sublinearity on ^ by the maximum principle 
applied to (p w on the finite components of "^o \ ^00,0: and using that the component sizes 
obey a polylogarithmic upper bound. The assumptions (1-3) are known to hold for the 
corrector of the supercritical bond-percolation cluster and the proof applies, with minor 
modifications, to the present case as well. The crux is to prove (|2.17I - [2T8)) which is 
where we need to borrow ideas from the "heat-kernel technology." For our purposes it 
will suffice to take b n = n in part (4). 

We remark that the outline strategy of proof extends rather seamlessly to other (tran- 
slation-invariant, ergodic) conductance distributions with conductances bounded from 
above. Of course, one has to assume a number of specific properties for the "strong" 
component < ^' oaA that, in the i.i.d. case, we are able to check explicitly. 

The plan of the rest of this paper is a follows: Sect.|3]is devoted to some basic percola- 
tion estimates needed in the rest of the paper. In Sect. |4] we define and prove some prop- 
erties of the corrector which is a random function marking the difference between the 
harmonic coordinate (pu,(x) and the geometric coordinate x. In Sect. |5] we establish the 
a.s. sublinearity of the corrector as stated in Theorem l2.4l subject to the diffusive bounds 
(|2.17I - [2TT8)| . Then we assemble all facts into the proof of Theorem 12. II Finally, in Sect. 
we adapt some arguments from Barlow (2) to prove (|2.17I - |2T8|) ; first in rather general 
Propositions 16. 1 1 and 16.21 and then for the case at hand. 

3. Percolation estimates 

In this section we provide a proof of Proposition 12.31 and also of a lemma dealing with 
the maximal distance the random walk Y can travel in a given number of steps. We will 
need to work with the "static" renormalization (cf Grimmett [17, Section 7.4]) whose 
salient features we will now recall. The underlying ideas go back to the work of Kesten 
and Zhang |IT9| , Grimmett and Marstrand [T8| and Antal and Pisztora (TJ. 

We say that an edge b is occupied if co\, > 0. Consider the lattice cubes 

B L (x) :=x+[0 r L] d nZ d (3.1) 

and 

B 3L {x) :=x+[-L,2L] rf nZ rf (3.2) 

and note that Bsl(x) consists of 3 d copies of £>l(*) that share only sites on their adjacent 
boundaries. Let Gl{x) be the "good event" — whose occurrence designates Bi(Lx) to 
be a "good block" — which is the set of configurations such that: 

(1) For each neighbor y of x, the side of the block Bi(Ly) adjacent to Bi(Lx) is con- 
nected to the opposite side of Bi(Ly) by an occupied path. 

(2) Any two occupied paths connecting Bi(Lx) to the boundary of B^i(Lx) are con- 
nected by an occupied path using only edges with both endpoints in B^Lx). 

The sheer existence of infinite cluster implies that (1) occurs with high probability once L 
is large (see Grimmett fllTl Theorem 8.97]) while the situation in (2) occurs with large 
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probability once there is percolation in half space (see Grimmett fl~7l Lemma 7.89]). It 
follows that 

P(G L (z)) —> 1 (3-3) 

L — >oo 

whenever ¥(cOj, > 0) > p c (d). A crucial consequence of the above conditions is that, 
if Gl{x) and Gi(y) occur for neighboring sites x,y G Z d , then the largest connected 
components in B^i(Lx) and Bo,i(Ly) — sometimes referred to as spanning clusters — are 
connected. Thus, if Gi(x) occurs for all x along an infinite path on Z d , the corresponding 
spanning clusters are subgraphs of ^o- 

A minor complication is that the events {Gi(x) : x G Z d } are not independent. How- 
ever, they are 4-dependent in the sense that if (x, ) and (y,-) are such that |x, — yA > 4 
for each i and j, then the families {Gl (*,■)} and {Gl(i//)} are independent. By the main 
result of Liggett, Schonmann and Stacey ll2il Theorem 0.0] (cf fl7l Theorem 7.65]) the in- 
dicators {1g l (x) : x £ Z ri }, regarded as a random process on Z d , stochastically dominate 
i.i.d. Bernoulli random variables whose density (of ones) tends to one as L — > oo. 

Proof of Proposition 12.31 In d = 2 the proof is actually very simple because it suffices to 
choose a such that (|2.8[) holds. Then \ c ^ OOA C Z 2 \ c tf C o A has only finite (subcritical) 
components whose diameter has exponential tails (|2.10|) by, e.g., [171 Theorem 6.10]. 

To handle general dimensions we will have to invoke the above "static" renormaliza- 
tion. Let Gi(x) be as above and consider the event Gi A (x) where we in addition require 
that CO}, (0, a) for every edge with both endpoints in Bo, L (Lx) . Clearly, 

lim limP(G La (z)) = 1. (3.4) 

Using the aforementioned domination by site percolation, and adjusting L and a we can 
thus ensure that, with probability one, the set 

{x e Z d : G LA (x) occurs} (3.5) 

has a unique infinite component Coo, whose complement has only finite components. 
Moreover, if 3(0) is the finite connected component of Z d \ Coo containing the origin, 
then a standard Peierls argument yields 

P(diamS(0) > n) < e^ n (3.6) 

for some £ > 0. To prove (|2.10|) , it suffices to show that 

^(0) C U B L {Lx) (3.7) 

xeS(o) 

once diam J^(0) > 3L. Indeed, then diam 3?(0) < L diam S (0) and so implies dZTOl 
with t] :=£/l and C := e 3L '?. 

To prove (|3.7|) , pick z E ,^(0) and let x be such that z G Bi(Lx). It suffices to show 
that if Gi A (x) occurs, then x is not adjacent to an infinite component in (|3.5|) . Assuming 
that x is adjacent to such a component, the fact that the spanning clusters in adjecent 
"good blocks" are connected and thus contained in ( €< X>A implies 

tfoo A D B L (Lx) ^0. (3.8) 
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But then Bi(Lx) is intersected by two "large" components, ^oo,a and ^(0), of edges 
with coi > a. (This is where we need diam^(O) > 3L.) If these components are joined 
by an occupied path — i.e., a path of edges with cuf, > — within Bt,i(Lx), then B^{Lx) 
contains a "weak" bond and so Gi A fails. In the absence of such a path the require- 
ment (2) in the definition of Gi{x) is not satisfied and so Gi A (x) fails too. □ 

Let d(x,y) be the "Markov distance" on V = < rfco,a, i.e., the minimal number of jumps 
the random walk Y = (Yt) needs to make to get from x to y. Note that d(x,y) could be 
quite smaller than the graph- theoretic distance on Itfao^ and /or the Euclidean distance. 
(The latter distances are known to be comparable, see Antal and Pisztora [1].) To control 
the volume growth for the Markov graph of the random walk Y we will need to know 
that d(x,y) is nevertheless comparable with the Euclidean distance \x — y|: 

Lemma 3.1 There exists q > and for each 7 > there is a. > obeying (12. 8142.91 ) and C < 00 
such that 

P(0,x e ^oo A & d(0,x) < q\x\ ) < Ce-tW, x e Z d . (3.9) 

Proof. Suppose a is as in the proof of Proposition [231 Let (n x ) be independent Bernoulli 
that dominate the indicators 1g Lk from below and let Coo be the unique infinite compo- 
nent of the set {x S Z rf : rj x = 1}. We may "wire" the "holes" of Coo by putting an edge 
between every pair of sites on the external boundary of each finite component of Z rf \ Coo; 
we use d'(0, x) to denote the distance between and x on the induced graph. The pro- 
cesses rj and (1q l t x \) can be coupled so that each connected component of ^00 \ ^oo,a 
with diameter exceeding 3L is "covered" by a finite component of Z d \ Coo, cf (|3.7|) . As 
is easy to check, this implies 

d(0,x) > d'(0,x') and \x'\ > hx\ -1 (3.10) 

whenever x S Bi(Lx'). It thus suffices to show the above bound for distance d'(0, x'). 

Let p = pi,u be the parameter of the Bernoulli distribution and recall that p can be 
made as close to one as desired by adjusting L and a. Let zq = 0,zi, . . . ,z n = x be a 
nearest-neighbor path on Z d . Let S(z,) be the unique finite component of Z rf \ Coo that 
contains z ; - — if z, S Coo, we have S(z;) = 0. Define 

£(z ,...,z n ) := EdiamS(z ! )(n i {z^s( 2l )})- ( 3 - n ) 

(=0 V ;<j 7 

We claim that for each A > we can adjust p so that 

Ee A,(z ° Zn) < e" (3.12) 

for all n > 1 and all paths as above. To verify this we note that the components con- 
tributing to £(zo, ... ,Zn) are distance at least one from one another. So conditioning on 
all but the last component, and the sites in the ultimate vicinity, we may use the Peierls 
argument to estimate the conditional expectation of e Adiam S(z«) ]-,y^ sa ^ e i (We are using 
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also that diam S(z„) is smaller than the boundary of S(z n ).) Proceeding by induction n 
times, (|3.12|) follows. 

For any given 7 > we can adjust p so that (|3.12|) holds for 

A:=2(l + log(2d)+7) (3.13) 

As the number of nearest-neighbor paths (zo = 0, . . . ,z n = x) is bounded by (2d) n , an 
exponential Chebyshev estimate then shows 

p(3(zo = 0,...,z n =xM(zo,...,z n ) > |) <e-T". (3.14) 

But if (zo = 0,..., z n = x) is the shortest nearest-neighbor interpolation of a path that 
achieves d'(0, x), then 

d'(0,x) >n-£(z Q ,...,z n ). (3.15) 
Since, trivially, | x| < n, we deduce 

P(d'(0,x) < < e^l (3.16) 
as desired. □ 



4. Corrector 

The purpose of this section is to define, and prove some properties of, the corrector 
X{w, x) := (pco{x) — x. This object could be defined probabilistically by the limit 

X(to,x) = lim (E W/X (X„) - E U/0 (X„)) - x, (4.1) 

unfortunately at this moment we seem to have no direct (probabilistic) argument show- 
ing that the limit exists. The traditional definition of the corrector involves spectral cal- 
culus (Kipnis and Varadhan |20J); we will invoke a projection construction from Mathieu 
and Piatnitski 1125 II (see also Giacomin, Olla and Spohn |15|). 

Let P be an i.i.d. law on (Q, JF) where Q := [0, l] Brf and & is the natural product 
tj-algebra. Let x x : O — > Q denote the shift by x, i.e., 

\Tzte)xy '■= <^>x+z,y+z (4.2) 

and note that P o t^ 1 = P for all x £ Z d . Recall that is the infinite connected 
component of edges with oo\, > and, for a > 0, let ¥?oo A denote the set of sites connected 
to infinity by edges with co b > a. If P(0 £ ^00 A ) > 0, let 

P a (-):=P(-|0£^ OO/ «) (4.3) 

and let E a be the corresponding expectation. Given co £ Q and sites x,y £ ^ca^co), 

let di^^ (x, y) denote the graph distance between x and y as measured on ( t? C o A . (Note this 
is distinct from the Markov distance d(x,y) discussed, e.g., in Lemma |3T] ) We will also 
use Leo to denote the generator of the continuous-time version of the walk X, i.e., 

(£*/)(*) :=i E "vl/Cy) -/(*)]• ( 4 - 4 ) 

y: \y-x\=l 
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The following theorem summarizes all relevant properties of the corrector: 

Theorem 4.1 Suppose P(0 G ^co) > 0. There exists a function %: Q x Z d — > K d suc/z f/wf 
the following holds Fq-u.s.: 

(1) (Gradient field) x(0,co) = and, for all x,y G < ^ o(<^ 1 ), 

x(u>,x)-x(<*>,y)=x(Ty< v 'X-y)- ( 4 - 5 ) 

(2) (Harmonicity) q> w (x) := x + ^;(o; / x) obeys C w q> w = 0. 

(3) (Square integrability) There is a constant C = C(a) < co swcfr that for all x,y G Z d 
satisfying \x — y\ = 1, 

E«(k(-,y) -^(-^)| 2 ^yl{xG^}) <C (4.6) 

Let a > Obe such that P(0 G ^o,*) > 0. Tfren we aZso /zaye: 

(4) (Polynomial growth) For every 6 > d, a.s., 

l im max l*(^ x )l = o. (4.7) 

(5) (Zero mean wncfer random shifts) Let Z : Q — > Z d be a random variable such that 

(a) Z(co) G ^^(w)/ 

(b) P« is preserved by co i— » Tz^^o?), 

(c) E a (dL fl) (0 / Z(o;))'?) < co for some q > 3d. 
Then x(-,Z(-)) G L x {Cl,& ,V a) and 

E«[x(-,Z(-))] =0. (4.8) 

As noted before, to construct the corrector we will invoke a projection argument. Ab- 
breviate L 2 (Q) = L 2 (Q, Po) and let B := . . . ,e^} be the set of coordinate vectors. 
Consider the space L 2 (Q x B) of square integrable functions u : O x B — > K d equipped 
with the inner product 

(«,w) :=E (X>(o;,&)-i7(a;,&)aA (4-9) 

b<=B 

We may interpret M G L 2 (Q x B) as a flow by putting u(cv, —b) = —u(X-\,U), b). Some, 
but not all, elements of L 2 (Q x B) can be obtained as gradients of local functions, where 
the gradient V is the map L 2 (Q) — » L 2 (Q x £>) defined by 

(V(p)(cv,b) :=<poT b (co) -<p(cv). (4.10) 

Let Ly denote the closure of the set of gradients of all local functions — i.e., those de- 
pending only on the portion of a; in a finite subset of Z rf — and note the following 
orthogonal decomposition L 2 (Q x B) - Ly © (Ly)" 1 . 
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The elements of (Ly )-*- can be characterized using the concept of divergence, which for 
u : Q x B — > R d is the function div m : Q — > R rf defined by 

divu(o;) := ^ [coi,u(co, b) — oj^uix^w, &)]. (4.11) 

foGB 

Using the interpretation of w as a flow, div w is simply the net flow out of the origin. The 
characterization of (Ly)- 1 is now as follows: 

Lemma 4.2 u £ (Ly)- 1 if and only if divw(a;) = 0/or Po-fl.e. a>. 

Proof. Let w e L 2 (Q x B) and let ^ G L 2 (Q) be a local function. A direct calculation and 
the fact that <x>_\, = (T_ b co) b yield 

(u,V<p) = -E ((p(to) divu(a;)). (4.12) 

If w G (Ly) -1 , then divw integrates to zero against all local functions. Since these are 
dense in L 2 (Q), we have div u = a.s. □ 

It is easy to check that every u G Ly is curl-free in the sense that for any oriented loop 
(xo, Xi,...,x n ) on ^^(co) with x n = xq we have 

n-l 

u(r Xj co, Xj+i - Xj) = 0. (4.13) 

j=0 

On the other hand, every u : O x B — > K rf which is curl-free can be integrated into a 
unique function </>: Q x ^co(-) — » IR d such that 

n-l 

<£(o;, x) = £ «(T X; a>, x ;+ i - x ; -) (4.14) 

holds for any path (xo, . . . , x n ) on ^(o;) with xo = and x„ = x. This function will 
automatically satisfy the shift-covariance property 

<p(co,x) — (p(co,y) = <p{x y co,x — y), x,y e ^(w). (4.15) 

We will denote the space of such functions H(C1 x Z d ). To denote the fact that <p is 
assembled from the shifts of u, we will write 

u = grad<^, (4.16) 

i.e./'grad" is a map from H(C1 x Z d ) to functions Q x B — » R d that takes a function 
</> G ft(Q x Z d ) and assigns to it the collection of values {</>(•, &) - <j>(-, 0) : fee B}. 

Lemma 4.3 Let <£> £ H(Cl x Z d ) fee suc/z that grad G (^v) -1 - Then <p is (discrete) harmonic 
for the random walk on i.e., for Po-fl.e. to and all x G ^(to), 

(C w cp)(cv,x)=0. (4.17) 
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Proof. Our definition of divergence is such that "div grad = 2dC w " holds. Lemma 
implies that u G (Ly) 1 " if and only if div u = 0, which is equivalent to (C w (p) (to, 0) = 0. 
By translation covariance, this extends to all sites in 'tfoo- □ 

Proof of Theorem WTi l -3 ) . Consider the function cp(co,x) := x and let u := gradcp. Clearly 
u G L 2 (Q x B). Let G S L v be the orthogonal projection of — u onto L v and define 
X G H(fl x Z rf ) to be the unique function such that 

G = grad* and x(-,0) =0. (4.18) 

This definition immediately implies (|4.5|) , while the definition of the inner product on 
L 2 (Q x B) directly yields (|4.6|) . Since u projects to — G on L v , we have u + G G (Ly) ± . 
But u + G = grad [x + ^(w, x)] and so, by Lemma l4~3l ihi + /C(<^/ is harmonic with 
respect to C w . □ 

Remark 4.4 We note that the corrector is actually uniquely determined by properties 
(1-3) of Theorem l4.ll In fact, x + % spans the orthogonal complement of L v in the space 
of shift-covariant functions. See Biskup and Spohn 

For the remaining parts of Theorem 14. 1 1 we will need to work on ^oo,a- However, we 
do not yet need the full power of Proposition l2.3t it suffices to note that %x> A has the law 
of a supercritical percolation cluster. 

Proof of Theorem \4l\ 4). Let 9 > d and abbreviate 

R n := max \x{u,x)\. (4.19) 

|x|<n 

By Theorem 1.1 of Antal and Pisztora [1], 

d (ffi Vn x) 

A(cv) := sup ,7 } < oo, P a -a.s., (4.20) 



X 



and so it suffices to show that R n /n d — ► on {A(o;) < A} for every A < oo. But on 
{A(o>) < A} every x G ^oo,a with |x| < n can be reached by a path on ^oo,a that does not 
leave [— An, \n] d and so, on {A(a>) < A}, 



K)3 < £ ^J^^\x(co f x + b)- X (w f x)\. (4.21) 



|x|<Arc 



Invoking the bound (|4.6|) we then get 

l|RKl{AM<A}l|2<Cn' i (4.22) 

for some constant C = C(a, A, d) < oo. Applying Chebyshev's inequality and sum- 
ming n over powers of 2 then yields R n /n —> a.s. on {A(a>) < A}. □ 
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Proof of Theorem \4~T\ 5). Let Z be a random variable satisfying the properties (a-c). By the 
fact that G € Ly, there exists a sequence ip n € L 2 (Q) such that 

ipn °T x -xp n — ► x(;x) in L 2 (C1 X B). (4.23) 

Abbreviate Xn{u,x) = ^« ° t x (o;) — tp n {<-^) and without loss of generality assume that 
Xn{-, x ) x{'' x ) almost surely. 

By the fact that Z is P a -preserving we have 'E a (x n {-,Z(-))) = as soon as we can 
show that Xn {•, Z(-)) 6 L 1 (Q) . It thus suffices to prove that 

Xn{;Z(-)) — x(.,Z(.)) inL^Q). (4.24) 
Abbreviate K(to) := dffl (0, Z(a>)) and note that, as in part (4), 



\Xn(cv,Z(cv))\< £ "£j^M±t\ Xn (co,x + b)-Xn(co,x)\. (4.25) 

\x\<K(w) 



The quantities 

y/^x+b I Xn (co, x + b)-xn (co, x) \ 1{ X& ?^} (4.26) 

are bounded in L 2 , uniformly in x, b and n, and assumption (c) tells us that K £ LI 
for some q > 3d. Ordering the edges in according to their distance from the origin, 
Lemma 4.5 of Berger and Biskup [6] with the specific choices 

p : = 2, s := <7/ d and N := d(2K + l) d (4.27) 

(note that N e L S (Q)) implies that for some r > 1, 

sup|U n (-,Z(-))|| r < oo. (4.28) 



Hence, the family {/£«(•, Z(-))} is uniformly integrable and l|4.24|) thus follows by the 
fact that Xn('/Z(-)) converge almost surely. □ 

Remark 4.5 It it worth pointing out that the proof of properties (1-3) extends nearly 
verbatim to the setting with arbitrary conductances and arbitrary long jumps (i.e., the 
case when B is simply all of Z d ). One only needs that x is in L 2 (Q x £>), i.e., 

e( £ cv , x \xA < oo. (4.29) 
The proof of (4-5) seems to require additional (and somewhat unwieldy) conditions. 



5. Convergence to Brownian motion 



Here we will prove Theorem [2Tj We commence by establishing the conclusion of Theo- 
rem !2.4l whose proof draws on an idea, suggested to us by Yuval Peres, that sublinearity 
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on average plus heat kernel upper bounds imply pointwise sublinearity. We have re- 
duced the extraneous input from heat-kernel technology to the assumptions (|2.17j - lzl8l) . 
These imply heat-kernel upper bounds but generally require less work to prove. 

The main technical part of Theorem 12. II is encapsulated into the following lemma: 
Lemma 5.1 Abusing the notation from (14.191 ) slightly, let 

R n := max (5.1) 

\x\<n 

Under the conditions (1,2,4) of Theorem \2.4\ for each e > and 5 > 0, there exists an a.s. finite 
random variable no = no(co,e,6) such that 

R n <en + 5R 3n . n > n . (5.2) 

Before we prove this, let us see how this and (|2.16|) imply (|2.19l l. 

Proof of Theorem 1241 Suppose that R„/n /» and pick c with < c < limsup^^ R„/n. 
Let 9 be is as in (|2.16|) and choose 



c 

2 " '~ 3 0H 



e := - and S := t^xt- (^.3) 



Note that then c' — e > 3 e 5c' for all c' > c. If R n > cn — which happens for infinitely 
many n's — and n > n§, then d5.2| | implies 

R3n > < —^n > 3 cn (5.4) 
d 

and, inductively, R 3 k n > 3 k0 cn. However, that contradicts (|2.16|l by which R 3 k n /3 ke —* 
as k — *• oo (with n fixed). □ 

The idea underlying Lemma [5.11 is simple: We run a continuous-time random walk 
(Yf) for time f = o(n 2 ) starting from the maximizer of R n and apply the harmonicity of 
x i — ► x -\- ip w {x) to derive an estimate on the expectation of tp(Y t ). The right-hand side 
of (J5.2|) expresses two characteristic situations that may occur at time t: Either we have 
l^f (Yf)| < en — which, by "sublinearity on average," happens with overwhelming 
probability — or Y will not yet have left the box [— 3n,3n] d and so ip w (Yt) < R3n- The 
point is to show that these are the dominating strategies. 

Proof of Lemma |5J1 Fix e, 5 > and let C\ = C\(a>) and C2 = C2(co) denote the suprema 
in (|2.17[) and (|2.18|) , respectively. Let z be the site where the maximum R n is achieved 
and denote 

G n ■= [ x e ^y. \x\ < n, \%[x)\ > \en). (5.5) 

Let Y = (Y f ) be a continuous-time random walk on c to caA with expectation for the walk 
started at z denoted by E W/Z . Define the stopping time 

S n := inf{£ > 0: \Y t -z| > 2n) (5.6) 
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and note that, in light of Proposition |2.3t we have \YtAS„ ~ A < 3ft for all f > provided 
n > tii(co) where fi\{co) < co a.s. The harmonicity of x i— » x + ipu{x) and the Optional 
Stopping Theorem yield 

Rn < E W/Z \ip w (VtAS n ) + YtAS,, - z|- (5.7) 

Restricting to t satisfying 

t > b 4n/ (5.8) 

where b n = o(n 2 ) is the sequence in part (4) of Theorem I2.4I we will now estimate the 
expectation separately on {S n < t} and {S„ > t}. 

On the event {S n < t }, the absolute value in the expectation can simply be bounded 
by R3„ + 3ft. To estimate the probability of S„ < t we decompose according to whether 
\Ylt — z| > § ft or not. For the former, (|5.8[) and \2.17) imply 



n r\ys i 3 \ Eo;,2 P^2t — Z 2 V2f 

Po7,z( Ya -Z > 2 n ) ^ ^ ^ a C l • ( 5 - 9 ) 

|n 3ft 

For the latter we invoke the inclusion 

{\Y 2t -z\< |n} n {S„ < f} C {\Y 2t - Y Sn \ > \n) n {S n < t} (5.10) 

and note that 2t — S n G [t, 2t], (|5.8|l and (|2.17|) give us similarly 

2 

£W(|Y S - ac| > n /i) < -CiVlt when x := Y s „ and s := It - S n . (5.11) 

From the Strong Markov Property we thus conclude that this serves also as a bound for 
Pw,z(Sn < t,\Y2t — z\ > |ft). Combining both parts and using |v2 < 4 we thus have 

P W(Z (S„ < t) < (5.12) 
n 

The S„ < £ part of the expectation 05.71 ) is bounded by R^n + 3ft times as much. 
On the event {S„ > t), the expectation in {5.7) is bounded by 

^w,z{\^pw(Yt)\ l{S„>f} 

)+E C0 , z \Y t -z\. (5.13) 

The second term on the right-hand side is then less than Qvf provided t >b n . The first 
term is estimated depending on whether Y t £ Gin or not: 

E„, z {\fMt)\ l{s„>t}) < \en + R 3n P W/Z (Y t G ^ 2n ). (5.14) 
For the probability of Y t € &i n we get 

P w ,z(Y G G 2 „) = £ P W/Z (Y f = x) (5.15) 

which, in light of the Cauchy-Schwarz estimate 

Pu,,z{Y t = xf < P w , z (Y t = z)P 0JiX {Y t = x) (5.16) 
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and the definition of C2, is further estimated by 

PuM e^2«) <c 2 ^. (5.17) 



From the above calculations we conclude that 



Rn < (R3n+3n)^^ + C 1 Vt + len + R 3n C 2 



2 J " z 



(5.18) 



Since |^2n| = o(n d ) as n — > oo, by (|2.15[) we can choose t := £n 2 with £ > so small that 
05.81 ) applies and H5.2D holds for the given e and J once n is sufficiently large. □ 

We now proceed to prove convergence of the random walk X = (X„) to Brownian 
motion. Most of the ideas are drawn directly from Berger and Biskup [6] so we stay 
rather brief. We will frequently work on the truncated infinite component *«foo ; « and the 
corresponding restriction of the random walk; cf <|2.1H - f27l3|) . We assume throughout 
that a is such that (IZ8H2T91 hold. 

Lemma 5.2 Let % be the corrector on ffoo- Then <p w {x) := x + #(a;,x) is harmonic for the 
random walk observed only on ^oo,a, is., 

4, fl W*)=o, Vie V (5.19) 

Proof. We have 

OcSVoOC*) = Eu,*(fw(XTi)) - <Pco(x) (5.20) 

But X n is confined to a finite component of ^oo \ ^oo, a for n S [0, Ti], and so f w (X n ) 
is bounded. Since (^o,(X„)) is a martingale and Ti is an a.s. finite stopping time, the 
Optional Stopping Theorem tells us E WrX cp (V (XT 1 ) = cp w (x) . □ 

Next we recall the proof of sublinearity of the corrector along coordinate directions: 

Lemma 5.3 For co G {0 G ^oo, a }, let (x n ((v)) ne z mark the intersections o/^»,a and one of 
the coordinate axis so that xq(co) = 0. Then 

lim X&*W = o, P a - a .s. (5.21) 

n— >oo n 

Proof. Let T x be the "shift by x" on O and let cr(to) := x Xy i w \{co) denote the "induced" 
shift. Standard arguments (cf [6 , Theorem 3.2]) prove that a is P a preserving and ergodic. 
Moreover, 

E K (d<£\0, x t (to)y) <oo, p<oo, (5.22) 

by [6, Lemma 4.3] (based on Antal and Pisztora HI). Define := x( w > x l( w ))- 

Theorem 14. If 5) tells us that 

YGL 1 ^) and E a Y(o;) = 0. (5.23) 



18 



M. BISKUP AND T. PRESCOTT 



But the gradient property of % implies 

^^)) = lg Yo ^) (5.24) 
n n k=0 

and so the left-hand side tends to zero a.s. by the Pointwise Ergodic Theorem. □ 
We will also need sublinearity of the corrector on average: 

Lemma 5.4 For each e > and F a -a.e. to: 

\ 

l™~d L 1 {WMIM =ft ( 5 - 25 ) 

\x\<n 

Proof. This follows from Lemma 1531 exactly as [6, Theorem 5.4]. □ 

Remark 5.5 The proof of [6, Theorem 5.4] makes a convenient use of separate ergod- 
icity (i.e., that with respect to shifts only in one of the coordinate directions). This is 
sufficient for i.i.d. environments as considered in the present situation. However, it is 
not hard to come up with a modification of the proof that covers general ergodic envi- 
ronments as well (Biskup and Deuschel [8|). 

Finally, we will assert the validity of the bounds on the return probability and ex- 
pected displacement of the walk from Theorem |2.4| 

Lemma 5.6 Let (Y t ) denote the continuous-time random walk on %a A . Then the diffusive 
bounds (IZlTtiZlgD hold for F a -a.e. to. 

We will prove this lemma at the very end of Sect. [6] 

Proof of Theorem |ZT] Let a. be such that I|2.8I42.9|I hold and let x denote the corrector on 
as constructed in Theorem |4.1[ The crux of the proof is to show that x grows sublinearly 
with x, i.e., x(w,x) = o(\x\) a.s. 

By Lemmas I5.2l andl 5.4[ Theorem l4.1f 4) and Lemma I5T61 the corrector satisfies the con- 
ditions of Theorem l2.4l It follows that x is sublinear on ^oo, a as stated in (|2.19[) . However, 
by H2.10I ) the largest component of \ in a box [— 2n, 2n) is less than C log n in di- 
ameter, for some random but finite C = C(co). Invoking the harmonicity of cp w on ^oo, 
the Optional Stopping Theorem gives 

max|^(o;, x)\ < max \x(u, x)\ + 2C(a?) log(2n), (5.26) 

\x\<n \x\<n 

whereby we deduce that x is sublinear on as well. 

Having proved the sublinearity of x on ^oo, we proceed as in the d = 2 proof of [6]. 
Let fco(x) := x + x{°°/ x ) and abbreviate M n := <po,(X n ). Fix v £ M. d and define 

f K (a>) :=E a;/ o((v-M 1 ) 2 l { |^. Ml |> K} ). (5.27) 
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By Theorem I4TT3). f K G L^O, J^Po) for all K. Since the Markov chain on environ- 
ments, n i — > Tx„ (co), is ergodic (cf 16, Section 3]), we thus have 

\ E /k ° Tx* (w) — ► E /k, (5.28) 
n £j 

for Po-a.e. a; and P^o-a.e. path X = (Xjt) of the random walk. Using this for K := 
and K := Ca/w along with the monotonicity of K i— > verifies the conditions of the 
Lindeberg-Feller Martingale Functional CLT (e.g., Durrett fl3l Theorem 7.7.3]). Thereby 
we conclude that the random continuous function 

t ^ ^=(v ■ M [nt] + (nt - [nt\)v ■ (M [nt]+1 - M [nt] )) (5.29) 

converges weakly to Brownian motion with mean zero and covariance 

E /o = E £^o((v-M 1 ) 2 ). (5.30) 

This can be written as v • Dv where D is the matrix with coefficients 

D i4 := E E c<; ,o((g i • M a )(g; • Mi)). (5.31) 

Invoking the Cramer-Wold device (e.g., Durrett Ifl3l Theorem 2.9.5]) and the fact that 
continuity of a stochastic process in 1R d is implied by the continuity of its d one-dim- 
ensional projections we get that the linear interpolation of t i— » M^j / \fn scales to d- 
dimensional Brownian motion with covariance matrix D. The sublinearity of the correc- 
tor then ensures, as in [6, (6.11-6.13)], that 

Xn-M n = x(to,Xn)=o(\X n \)=o(\M H \) = o(Vn), (5.32) 

and so the same conclusion applies to t i— > B n (t) in H2.2D . 

The reflection symmetry of Po forces D to be diagonal; the rotation symmetry then 
ensures that D = { l /d)c 2 1 where 

a 1 ^EoE^olMil 2 (5.33) 

To see that the limiting process is not degenerate to zero we note that if we had a = 
then x(~' x ) = ~ x would hold a.s. for all x G Z d . But that is impossible since, as we 
proved above, x i— » %(-, x) is sublinear a.s. □ 

Remark 5.7 Note that, unlike the proofs in 11291 16l l25l , the above line of argument does 
not require a separate proof of tightness. In our approach, this comes rather automati- 
cally for the deformed random walk cp w (X n ) — via the (soft) stationarity argument (|5.28|) 
and the Martingale Functional CLT. Sublinearity of the corrector then extends it readily 
to the original random walk. 

Remark 5.8 We also wish to use the opportunity to correct an erroneous argument 
from 16]. There, at the end of the proof of Theorem 6.2 it is claimed that the expectation 
EoEa; ; o(Xi • x(Xi,co)) is zero. Unfortunately, this is false. In fact, we have 

E E^(X 1 -x(Xi,u)) = -E E a;/0 |x(X 1 ,a;)| 2 < 0. (5.34) 
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where the strict inequality assumes that P is non-degenerate. This shows 

EoE^olMal 2 = E E^o|X 1 | 2 -E E a , / o|;t(Xi / a;)| 2 < E Ea,,o|Xi| 2 . (5.35) 

Thus, once P is non-degenerate, the diffusion constant of the limiting Brownian motion 
is strictly smaller than the variance of the first step. 

A consequence of the above error for the proof of Theorem 6.2 in |6) is that it inval- 
idates one of the three listed arguments to prove that the limiting Brownian motion is 
non-degenerate. Fortunately, the remaining two arguments are correct. 



6. Heat kernel and expected distance 

Here we will derive the bounds (|2.17I - [2TT8)) and thus establish Lemma 15.61 Most of the 
derivation will be done for a general countable-state Markov chain; we will specialize 
to random walk among i.i.d. conductances at the very end of this section. The general 
ideas underlying these derivations are fairly standard and exist, in some form, in the 
literature. A novel aspect is the way we control the non-uniformity of volume-growth 
caused by local irregularities of the underlying graph; cf (|6.4|) and Lemma l63l l). A well 
informed reader may nevertheless wish to read only the statements of Propositions 16.11 
and 16.21 and then pass directly to the proof of Lemma IBT61 

Let V be a countable set and let {a xy ) X/ yev denote the collection of positive numbers 
with the following properties: For all x,y G V, 

a X y = a ¥ x and n(x) := £ a xy < oo. (6.1) 

yev 

Consider a continuous time Markov chain (Y f ) on V with the generator 

(£/) W :=^vE [/(y) " /(*)] • (6-2) 
a \ x > yev 

We use P x to denote the law of the chain started from x, and E x to denote the corre- 
sponding expectation. Consider a graph G = (V, E) where E is the set of all pairs (x,y) 
such that a xlJ > 0. Let d(x, y) denote the distance between x and y as measured on G. 

For each x G V, let B n (x) := {y G V: d(x,y) < n}. If A C V, we use Q(A, A c ) to 
denote the sum 

Q(A,A C ) := £ ^ «*y- (6-3) 

X £AyeA c 

Suppose that there are constants d > and v G (0, V2) such that, for some a > 0, 

C vol {x,a) := sup \ s d £ 7r(y) e - sd M < 00 (6.4) 



0<s<fl 



and 

Q so (x) := inf inf( ^ A '^J : A C B 2n {x), n{A) > n\ A connected) > 0. (6.5) 
I 7r(A)— J 
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Let V(e) C V denote the set of all x G V that are connected to infinity by a self-avoiding 
path (xq = x, x\, . . . ) with a Xi/XM > e for all i > 0. Suppose that 

a* := sup{e > 0: V(e) = V} > 0. (6.6) 
(Note that this does not require a xy be bounded away from zero.) 
The first observation is that the heat-kernel, defined by 

q t (x,y) := PX{Yt = y \ (6.7) 

can be bounded in terms of the isoperimetry constant Q so (x). Bounds of this form are 
well known and have been derived by e.g., Coulhon, Grigor 'yan and Pittet IfTOl for heat- 
kernel on manifolds, and by Lovasz and Kannan [22], Morris and Peres [23] and Goel, 
Montenegro and Tetali [TBI in the context of countable-state Markov chains. We will use 
the formulation for infinite graphs developed in Morris and Peres [23]. 

Proposition 6.1 There exists a constant c\ G (1, oo) depending only on d and a+ such that for 
t(x) := Ci[log(Cjso(x) Vci)]t=^ wehave 

sup sup q t {z,y) < a^j^- — , t>t(x). (6.8) 

zeB t (x) yev 1 

Proof. We will first derive the corresponding bound for the discrete-time version of (Yf). 
Let P(x,y) := a xy /n(x) and define P := \ {1 + P). Let 

«.<*y> =- ^ <«) 

We claim that, for some absolute constant C\ and any z G B n (x), 



Ciso(^) 



-d 



q n (z,y) < Cl ^ , n > t(x). (6.10) 



To this end, let Q be the object Q for the Markov chain P and let 

<p(r) := inf{ Q^ C) : zr(A) < r, A C B 2k (x)}. (6.11) 

We claim that Theorem 2 of Morris and Peres [23] then implies that, for any e that satis- 
fies < e < [n(x) A n(y)Y l and 

n > 1 + / (6.12) 

we have q n {z,y) < e. To see this we have to check that the restriction A C B2 n {x) in the 
definition of <p(r), which is absent from the corresponding object in [23], causes no harm. 
First note that the Markov chain started at z G B n (x) will not leave £>2,i(x) by time n. 
Thus, we can modify the chain outside £>2n(x) arbitrarily. It is easy to come up with a 
modification that will effectively reduce the infimum in (|6.11|) to sets inside £>2n(*)- 
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It is well known (and easy to check) that the infimum in ( 16.111) can be restricted to 
connected sets A. Then (|6.5H6.6[) give us 

<pir) > \ {C iso (x)r- 1/d A a^) (6.13) 

where the extra half arises due the consideration of time-delayed chain P. The two 
regimes cross over at r n := (Q so (x) / a±) d n dv ; the integral is thus bounded by 

f" e J^<4lo g (^) + 2 <i C 1 „(,)-(l) 2/i . (6.14) 

J4(n(z)/\n( y )) r<p(r) 2 flj °V4fl*/ \eJ 

The first term splits into a harmless factor of order n 2l 'logn = o(n) and a term propor- 
tional to n 2v log Cjso(x). This is O(n) by n > t(x) where the (implicit) constant can be 
made as small as desired by choosing c\ sufficiently large. Setting 

e := c[C iso (x) 2 n]- d/2 (6.15) 

we can thus adjust the constant c in such a way that (|6.14|) is less than n — 1 for all 
n > t(x). Thereby d6.10l > follows. 

To extend the bound l|6.10|) to continuous time, we note that C = 2(P — 1). Thus if N t 
is Poisson with parameter 2£, then 

q t (z,y) = Ecj Nt (z,y). (6.16) 

But P(Nt < |f or Nt > 3t) is exponentially small in t and, in particular, much smaller 
than l|6.8[) for t > c\ log Q so (x) with c\ sufficiently large. As q t < (fl*) _1 , the N t ^ (§f, 3t) 
portion of the expectation in (|6.16|) is thus negligible. Once N t is constrained to the 
interval (f, 3f) the uniform bound (|6.10|) implies (|6.8[) . □ 

Our next item of business is a diffusive bound on the expected (graph-theoretical) 
distance traveled by the walk Y t by time t . As was noted by Bass [4] and Nash |27|, this 
can be derived from the above uniform bound on the heat-kernel assuming regularity 
of the volume growth. Our proof is an adaptation of an argument of Barlow |2|. 

Proposition 6.2 There exist constants Cz = Cz(d) and C3 = 03(d) such that the following 
holds: Let x G V and suppose A > and t(x) > 1 are numbers for which 

sup^ t (x,y) < -tjx, t > t(x), (6.17) 
yeV 1 

holds and let T(x) := \ (Aa*)- i/d V [t(x) log t(x)}. Then 

E x d(x,Y t ) < A'(x,t)Vt, t > T(x), (6.18) 

with A'(x,t) := c 2 + c 3 [logA + C vol (x,r 1/2 )]. 

Much of the proof boils down to the derivation of rather inconspicuous but deep 
relations (discovered by Nash |27| ) between the following quantities: 

M(x,t) := E x d(x,Y f ) =Y j n{y)q t {x r y)d{x r y) (6.19) 
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and 

Q(x,t) := -E x log q t (x,Y t ) = -Y^7i(y)q t (x r y)logq t (x,y). (6.20) 

y 

Note that q t (x, •) < implies Q(x,t) > log a*. 

Lemma 6.3 For all t > and a/Z x 6 V, 

(1) M(x, f) d > exp{-l - C vol (x,M(x, f)- 1 ) + Q(z, 0}/ 

(2) M'(x,f) 2 < Q'(x,£). 

Proof. Part (2) is identical to the proof of Lemma 3.3 in Barlow ||2| so we only need to 
prove part (1). We will follow the proof of [2, Lemma 3.3] except that we need to keep 
track of C vo i(x) in the calculation. Pick two numbers a > and b G R and note that 

wlogw + A« > -e~ A_1 . (6.21) 

This implies 

- Q(x, t) + aM{x, t)+b>~Y J Tr(y)e- b - 1 - ad{x 'y\ (6.22) 

y 

Using the definition of C vo i(x, a) and bounding e _1 < 1 we get 

- Q(x,t) +aM(x,t) +b > -C vo i(x,a)e- h a- d . (6.23) 

Now set e~ b := a d with a := M(x, f) -1 to get the result. □ 

These bounds imply the desired diffusive estimate on M(x, t): 

Proof of Proposition \6.2\ Suppose without loss of generality that M(x, t) > \/t, because 
otherwise there is nothing to prove. We follow the proof of [2, Proposition 3.4]. The key 
input is provided by the inequalities in Lemma 1631 Define the function 

L(f) := \(Q{x,t) +logA- i]0gt) (6.24) 
and note that L(t) > for t > t(x). Let 

f := {Aa+)- 2/d V sup{f > 0: L(t) < 0}. (6.25) 
We claim that M(x, t ) < y / dT(x). Indeed, when f = (Aa*)~ 2/(f then this follows by 

M(x, t ) <t = (Aa*)- 2/d < ^dT(x) (6.26) 

due to our choice of T(x). On the other hand, when Iq > (AaJ)~ 2 ^ d we use Lemma l6.3r 2), 
the Fundamental Theorem of Calculus and the Cauchy-Schwarz inequality to derive 

M(x,t ) < Vk[Q{x,t Q )-Q{x r Q)] V2 . (6.27) 
Since Q(x,0) > log a* and L(fo) = by continuity, we have 

, d \l/2 

M(x,to) < y/t ~( y -\ogt -lo S A-\oga±) < yjdt log t (6.28) 
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where we used that to > (Aa*)~ 2//rf implies log A + log a* > — |logfo- Since this 
implies to > 1, the condition to < £(x) shows that the right-hand side is again less 
than ^/dT{x). 

For t > to we have L(t) > 0. Lemma I63L 2) yields 

,- ft , J , x 1/2 

M(x,t) -M(x,f ) < Vd / f — + L'(s)) ds 



< [ (-^ = + L'(s) y / s / 2 )ds < Vldi+L(t)Vdi, 
J to v V 2s ' 



(6.29) 



where we used integration by parts and the positivity of L to derive the last inequal- 
ity. Now put this together with M(x,to) < \fd~t and apply Lemma I63U ), noting that 
Cy i(z,M(x, f) _1 ) < C V oi( z / £~ 1/2 ) by the assumption M(x, t) > \ft. Dividing out an 
overall factor \ft, we thus get 

[ Ae c vol (*,r^)j -i/d e - 1/i+L(f ) < 3v ^ + ^ L(f) _ (6 _ 30) 

This implies 

L{t) < c 2 + c 3 [logA + C vol (x,r 1/2 )] (6.31) 

for some constants c 2 and £3 depending only on d. Plugging this in (|6.29|) , we get the 
desired claim. □ 

We are now finally ready to complete the proof of our main theorem: 



Proof of Lemma 15^61 We will apply the above estimates to obtain the proof of the bounds 
(|2.17|42Tl"8)) . We use the following specific choices 

V := %o A , a xy := w xy , n[x) := 2d, and b„ := n. (6.32) 

As a* > oc, all required assumptions are satisfied. 

To prove (|2.18|) , we note that by Lemma 3.4 of Berger, Biskup, Hoffman and Kozma 
(based on the isoperimetric inequality for the supercritical bond-percolation cluster, cf 
Benjamini and Mossel [5], Rau [28. Proposition 1.2] or Section 5]) we have Q so (0) > 
a.s. Hence, Proposition 16.11 ensures that, for all z G ^00 A with |z| < t, 

t d/2 Pu,,z(Yt =z)< 2dc 1 C iso (0)- d (6.33) 

provided f exceeds some t\ depending on Q so (0) . From here (|2.18|) immediately follows. 
To prove (|2.17[) , we have to show that, a.s., 

sup max sup C vo i(z, t~ 1/2 ) < 00. (6.34) 

h>1 ze< ^~,« t>n 
\z\<n 

To this end we note that Lemma l3Tl implies that there is a.s. finite C = C(co) such that 
for all z,y G ^oo,^ with |z| < n and |z — y\ > Clogn, 

d{z,y)>q\z-y\. (6.35) 
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It follows that, once V 8 > C log n, for every z S with |z| < n we have 

£ e -«d(z, v ) < C4fl -rf + £ e -«c|z-yl < C5fl -^ (6.36) 

\y—z\>l/a 

where C4 and C5 are constants depending on d and Since V s = £ 1/2 > 1/" ^ logn, 
(|634|) follows. 

Once we have the uniform bound (|6.34|) , as well as the uniform bound (|6.17|l . Propo- 
sition [62] yields the a.s. inequality 

E W/Z d(z,Y t ) 

sup max sup — ; — -= < 00. (6.37) 

n>l ze< ^~,« t>n V £ 

|z|<n 

To convert d(z, Y ( ) into |z — Y f | in the expectation, we invoke (|6.35|) one more time. □ 
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